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SYSTEM AND METHOD FOR ROBUST LOSSLESS DATA HIDING AND 
RECOVERING FROM THE INTEGER WAVELET REPRESENTATION 

BACKGROUND ART 

The present invention relates to methods and apparatus for embedding data in an 
image and recovering the embedded data therefrom. 

Data hiding, also known as data concealing, is a process for embedding useful data 
(information) into a cover media, such as image data. Cover media with data embedded 
therein is referred to herein as "marked media." Data hiding may be employed for the 
purposes of identification, annotation, copyright protection, and authentication. In such 
applications, the hidden data and the cover media may be closely related. This type of data 
embedding is often referred to as watermarking or more generally as "marking." It is desirable 
for the hidden data to be perceptually transparent. Otherwise stated, the marked media should 
resemble the cover media as closely as possible. 

The cover media will generally experience some distortion due to the presence of 
embedded data. Moreover, even after the embedded data are removed, it is generally difficult 
to restore the cover media to the condition it was in prior to embedding the data. Specifically, 
some permanent distortion of the cover media generally remains even after the hidden data 
have been extracted. The sources of distortion include round-off error, truncation error, and 
quantization error. This distortion presents a problem since for some applications such as 
medical diagnosis and law enforcement, it is important to accurately restore the cover media to 
the pre-embedding condition once the hidden data have been retrieved. The marking 
techniques satisfying this requirement are referred to as lossless or distortionless. Such 
marking techniques are also known as reversible marking techniques and are generally suitable 
for applications where the original media data should be accurately recovered. 

Recently, some lossless marking techniques have been reported in the literature. 
The first method is carried out in the image spatial domain. See U.S. Patent 6,278,791, issued 
August 21, 2001, entitled "Lossless Recovery of an Original Image Containing Embedded 
Data," by C. W. Honsinger, P. Jones, M. Rabbani, and J. C. Stoffel (referred to herein as 
"Honsinger"), the disclosure of which is hereby incorporated herein by reference. 

Another spatial domain technique was reported in Fridrich. J. Fridrich, M. Goljan 
and R. Du, "Invertible authentication," Proc. SPIE, Security and Watermarking of Multimedia 
Contents, San Jose, CA, January (2001) (referred to herein as "Fridrich"). The entire 
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5 disclosure of this document is hereby incorporated herein by reference. There also exists a 
distortionless marking technique in the transform domain. B. Macq and F. Deweyand, 
"Trusted headers for medical images," DFG VIII-D II Watermarking Workshop, Erlangen, 
Germany, Oct. 1999 (referred to herein as "Macq"). The entire disclosure of this document is 
hereby incorporated herein by reference. 

10 The capacity of the method reported in "De Vleeschouwer" is also very limited 

except that it exhibits robustness against high quality JPEG compression. C. De Vleeschouwer, 
J. F. Delaigle and B. Macq, "Circular interpretation on histogram for reversible watermarking," 
IEEE International Multimedia Signal Processing Workshop, Cannes, France, pp.345-350, 
October 2001 (referred to herein as "De Vleeschouwer") The entire disclosure of this 

15 document is hereby incorporated herein by reference. These techniques are directed to 
authentication rather than to data hiding, and the total quantity of data embedded in the cover 
media is therefore limited. 

The first lossless marking technique that is suitable for high embedding rate data 
hiding was presented in Goljan, M. Goljan, J. Fridrich, and R. Du, "Distortion-free data 

20 embedding," Proceedings of 4 th Information Hiding Workshop, Pittsburgh, PA, April, 2001 
(referred to herein as Goljan), the entire disclosure of which document is hereby incorporated 
herein by reference. In Goljan, the pixels in an image are divided into non-overlapped blocks, 
each block consisting of a number of adjacent pixels. For instance, this block could be a 
horizontal block having four consecutive pixels. A discrimination function is established to 

25 classify the blocks into three different categories: Regular, Singular, and Unusable. The 
authors used the discrimination function to capture the smoothness of the groups. 

An invertible operation can be applied to groups. Specifically, the invertible 
operation can map a gray-level value to another gray-level value. This operation is invertible 
since applying it to a gray level value twice produces the original gray level value. This 

30 invertible operation is therefore called "flipping." For typical images, flipping with a small 
amplitude will lead to an increase of the discrimination function, resulting in more regular 
groups and fewer singular groups. It is this bias that enables distortionless data hiding. While 
this approach hides data without distorting the cover data, the quantity of data which may be 
hidden employing this technique is still not large enough for certain applications. The payload 

35 was estimated to be in a range from 3,000 bits to 24,000 bits fora512x512x8 gray image 
according to Goljan. Another problem with the method is that as the quantity of data 
embedded in the image increases, the visual quality of the image decreases. For instance, the 
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5 PSNR (Peak Signal to Noise Ratio) may drop as low as 35 dB (decibels), and some undesired 
artifacts may appear in the image. 

A method by Xuan, based on the integer wavelet transform, is a recently proposed 
reversible data hiding technique that can embed a large quantity of data. Guorong Xuan, 
Jidong Chen, Jiang Zhu, Yun Q. Shi, Zhicheng Ni, Wei Su, "Distortionless Data Hiding Based 

10 on Integer Wavelet Transform," IEEE International Workshop on Multimedia Signal 
Processing, St. Thomas, US Virgin islands, December 2002 (referred to herein as "Xuan"). 
This document is hereby incorporated herein by reference. The main idea in Xuan is as 
follows. After the integer wavelet transform is applied to the original image, the bias between 
binary ones and binary zeroes in the bit-planes of the sub-bands LH, HL, and HH is 

15 significantly increased. Hence, the ones and zeroes in these bit planes can be losslessly 
compressed to leave a lot of storage space for data embedding. After data embedding, an 
inverse integer wavelet transform is applied to form the marked image. The capacity achieved 
in this technique is quite large. The PSNR of the marked image is, however, not high due to the 
histogram modification applied in order to avoid overflow or underflow conditions. For some 

20 images, the PSNR is only 28 dB. 

One method based on histogram manipulation is a recently disclosed lossless data 
hiding technique, which can embed a large amount of data (5k-80k bits for a 5 12 x 512 x 8 
grayscale image) while preserving high visual quality (the PSNR is guaranteed to be above 48 
dB) for a vast majority of images. Z. Ni, Y. Q. Slii, N. Ansari and W. Su, "Reversible data 
25 hiding," IEEE International Symposium on Circuits and Systems, May 2003, Bangkok, 
Thailand, (referred to herein as "Ni"). This document is hereby incorporated herein by 
reference. This document (Ni) is not conceded to be prior art merely through its placement in 
the "Background of the Invention" section of this patent application. 

However, only one prior lossless data hiding technique (De Vleeschouwer) is robust 
30 against compression applied to the stego-image (the image including the embedded data). 
Specifically, only in Vleeschouwer can the hidden data still be extracted out correctly after the 
stego-media has gone through compression. With the other existing techniques, the embedded 
data cannot be recovered without error after stego-media compression. 

With reference to FIGS. 1A and B, while the De Vleeschouwer technique is robust 
35 to compression, it generates annoying salt-and-pepper noise because it uses modulo-256 
addition. That is, when the pixel grayscale value is close to 256 (brightest) and/or 0 (darkest), 
the modulo-256 addition will likely cause flipping over between the brightest and darkest gray 
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5 values. This often happens with medical images. This type of salt-pepper noise is 
unacceptable for many applications. 

Therefore, there is a need in the art for a system and method for embedding an 
amount of robust data into cover media in a reversible manner (the original cover media can be 
reserved without salt-and-pepper noise). 

10 

DISCLOSURE OF THE INVENTION 

A method is provided which comprises providing a block of IWT (integer wavelet 
transform) coefficients for at least one frequency sub-band of an image; determining a mean 
value of said coefficients within the block; and establishing an encoded mean value to embed 
15 one of a logical-0 bit value and a logical- 1 bit into the first block. 

Other aspects, features, advantages, etc. will become apparent to one skilled in the 
art when the description of the preferred embodiments of the invention herein is taken in 
conjunction with the accompanying drawings. 



20 BRIEF DESCRIPTION OF THE DRAWINGS 

For the purposes of illustrating the various aspects of the invention, there are shown 
in the drawings forms that are presently preferred, it being understood, however, that the 
invention is not limited to the precise arrangements and instrumentalities shown. 

FIGS. 1 A-B are images illustrating an undesirable salt-and-pepper effect that occurs 
25 using conventional data hiding techniques; 

FIG. 2 is a graph illustrating a histogram of IWT coefficients in the HLI sub-band 
for a test image (red color plane of a JPEG2000 test image, called Nl A) suitable for marking in 
accordance with one or more aspects of the present invention; 

FIG. 3 is a graph illustrating a histogram of mean values of coefficients of 10X10 
30 blocks in the HL1 sub-band of the N1A test image suitable for marking in accordance with one 
or more aspects of the present invention; 

FIG. 4 is a matrix including information identifying a mask for use in determining 
which of a group of IWT coefficients are to be modified during a data embedding operation 
conducted in accordance with one or more aspects of the present invention; 
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5 FIGS. 5A-5D are histograms showing distributions of spatial-domain grayscale 

values for pixels of a block of an image for four respective categories of blocks for marking in 
accordance with one or more aspects of the present invention; 

FIG. 6 is a data table showing the filter coefficients of a 5/3 wavelet filter suitable 
for use in accordance with one or more aspects of the present invention: 

10 FIG. 7 illustrates a matrix showing the unit step response in the spatial domain for a 

change in the value of an IWT coefficient at the (i, j) position in the HL1 sub-band in 
accordance with one or more aspects of the present invention; 

FIG. 8 illustrates a matrix showing the unit step response in the spatial domain for a 
change in the value of an IWT coefficient at the (i, j) position in the LHi sub-band in 
15 accordance with one or more aspects of the present invention; 

FIG. 9 illustrates a matrix showing the unit step response in the spatial domain for a 
change in the value of an IWT coefficient at the (i, j) position in the LLi sub-band in 
accordance with one or more aspects of the present invention; 

FIG. 10 is a block diagram showing the relationship between IWT coefficients and 
20 groups of affected pixels in the spatial domain in accordance with one or more aspects of the 
present invention; 

FIG. 1 1 is a matrix illustrating one quarter of the unit step response of CllOj) and 
one quarter of the unit step response of C L L(ij +1 ) in accordance with one or more aspects of 
the present invention: 

25 FIG. 12 is a graph illustrating a data embedding rule for a block of type A in 

accordance with one or more aspects of the present invention; 

FIG. 13 is a graph illustrating a data embedding rule for a block of type B in 
accordance with one or more aspects of the present invention; 

FIG. 14 is a graph illustrating a data embedding rule for a block of type C in 
30 accordance with one or more aspects of the present invention; 

FIG. 15 is a block diagram of a system and method for data embedding in 
accordance with one or more aspects of the present invention; 

" FIG. 16 is a flow diagram for a method for restoring cover media data in 
accordance with one or more aspects of the present invention; 
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5 FIG. 17 is an illustration of a medical image that has been marked in accordance 

with preferred data embedding method in accordance with one or more aspects of the present 
invention; 

FIG. 18 is an illustration of a test (N1A) image that has been marked in accordance 
with preferred data embedding method in accordance with one or more aspects of the present 
10 invention; 

FIG. 19 is ' a data table of experimental results correlating block size and data 
storage capacity when marking an image employing a method in accordance with one or more 
aspects of the present invention; 

FIG. 20 is a data table of experimental results illustrating the effect of block size on 
1 5 mean block coefficient value in accordance with one or more aspects of the present invention; 

FIG. 21 illustrates an image that has been marked using the variables block size = 5 
and shift quantity = 8, yielding a PSNR (peak signal to noise ratio) of 40.09 dB (decibels) in 
accordance with one or more aspects of the present invention; 

FIG. 22 illustrates an image that has been marked using the variables block size = 
20 12 and shift quantity = 2, yielding a PSNR of 49.46 dB in accordance with one or more aspects 
of the present invention; 

FIG. 23 is a data table showing values of PSNR occurring with various 
combinations of values of block size and shift quantity in accordance with one or more aspects 
of the present invention; 

25 FIG. 24 is a graph plotting data survival rate versus PSNR for various block sizes in 

accordance with one or more aspects of the present invention; 

FIG. 25 is a data table showing test results employing a 3X3 median filter in 
accordance with one or more aspects of the present invention; and 

FIG. 26 is a data table showing test results in the presence of additive Gaussian 
30 noise having a zero mean and a variance of 0.001 in accordance with one or more aspects of 
the present invention. 

BEST MODE OF CARRYING OUT THE INVENTION 

We have studied the features of image wavelet transforms and found out that 
35 coefficients of the high frequency sub-bands (HL n , LH n , or HH n ) have a zero-mean and 
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5 Laplacian-like distribution. An example of this is the HLi sub-band of a N1A JPEG2000 test 
image, called N1A, depicted in FIG. 2 where the X-axis represents the IWT (Integer Wavelet 
Transform) coefficient values, while the Y-axis represents the numbers of coefficients that 
assume the respective coefficient values. We further divide this sub-band into non-overlapping 
square blocks with the side length B and calculate the mean of the coefficients values in each 
10 block. 

MeanV=±±±C? (1) 

t> i=l y=l 

where the superscript (t) indicates the t th block, and Cl° denotes the IWT 

coefficients of the t th block. FIG. 3 illustrates one example of the distribution of these mean 
values. In FIG. 3, the X axis represents the mean values of the 10x 10 non-overlapping blocks 
15 (i.e. side length B = 10) in the HLi sub-band of the N1A JPEG2000 test image, and the Y axis 
represents the numbers of blocks that have the respective mean values. It can be observed that 
the variance of the mean values shown in FIG. 3 is much smaller than the variance of the 
coefficients themselves. 

One aspect of our proposed semi-fragile lossless data hiding algorithm is to exploit 
20 this property. One bit of data will preferably be embedded into each of the non-overlapping 
blocks. At first, we scan all the blocks and find out the maximum absolute mean value of 
coefficients, denoted by m max . 

m maJ c = Max§ mean ^ I) ( 2 ) 

Again, "t" denotes the block index, and Max denotes the operation to take 
25 maximum value. A threshold T is set to be the smallest integer number which is no less than 
the m max . As a result, all blocks' mean values' absolute values will be less than or equal to T. 
Now, one bit can be embedded into each of the non-overlapping blocks by establishing (or 
manipulating) mean values of each of the blocks. If a logical- 1 bit value is to be embedded 
into a block, the mean value of the block will be shifted in the direction away from 0 by a shift 
30 quantity S, which is preferably equal to or larger than T. If a logical-0 bit value is to be 
embedded, the block and the mean value of the IWT coefficients in the block are preferably 
left unchanged. Therefore, a mean value of coefficients of a block having an absolute value 
larger than T preferably represents a hidden binary "1" in the block. A mean coefficient value 
smaller than T preferably represents an embedded binary c 0\ 
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5 Preferably, since the shift quantity S is fixed for all blocks, the original coefficients 

can be recovered by conducting the reverse operation, i.e., subtracting S from the IWT 
coefficients of blocks in which a logical- 1 bit value ("1" bit) is embedded. Thus, the 
embedding operation is preferably reversible. As a result, the cover media can be restored 
without error if the stego image has not been distorted. Moreover, since we embed data by 
10 controlling the mean value of the IWT coefficients in one block, minor changes to the image 
caused by unintentional attacks such as JPEG/JPEG2000 compression will not cause the mean 
value to change much. Therefore, the correct detection of the hidden data is preferably still 
possible even if the stego-image is slightly altered. We verify this claim later in this 
application. 

15 It should be pointed out that lossless watermarking is based on a principle of 

information theory that all original cover-media information should be preserved in the marked 
image throughout the lossless data hiding process. Therefore, if the marked images are 
subjected to lossy compression, the original cover media generally can no longer be recovered, 
since information about the original image may have been lost. It is also noted that we 

20 generally choose the HLi or LHi high frequency sub-bands to embed data. That is, though the 
mean value distribution of blocks of IWT coefficients of higher level high frequency sub- 
bands, HL n or LH n (n>l) also behave according to the theory presented above, we generally do 
not embed data in these high level IWT coefficients. This is because alterations to the higher 
level sub-bands can significantly degrade the image quality. 

25 Proposed lossless data hiding algorithm 

The issue of how to prevent overflow/underflow is addressed first. The proposed 
data embedding procedure is then presented. 

Origin of overflow/underflow. 

If all of the images we handle are in JPEG2000 format, the proposed method in the 
30 previous section would be sufficient. Let us assume that we have an image in the JPEG2000 
format. We can have all the IWT coefficients derived from the image file through tier-2 
decoding, followed by tier-1 decoding as disclosed in Christopoulos. C. Christopoulos, A. 
Skodras, T. Ebrahimi, "The JPEG2000 still image coding system: an overview," IEEE Trans, 
On Consumer Electronics, vol.46, no.4, pp.1 103-1 127, Nov. 2000 (referred to herein as 
35 "Christopoulos"). This document is hereby incorporated herein by reference. 

Then, the hidden data can be embedded into the high frequency IWT coefficients. 
The changed coefficients can be encoded again to form a marked JPEG2000 file which is 
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5 ready for applications such as transmission or printing. At the receiver end, the hidden data 
could be extracted from the marked JPEG2000 image file. If no alteration has been inflicted 
on the marked image, all the modified coefficients can preferably be shifted back to their 
original values, assuming that the data extraction procedure knows the value of the shift 
quantity used in the data embedding. We thus have the recovered version of the image which 

10 is preferably the same as the original one. This is because the JPEG2000 encoding preferably 
ensures that the wavelet coefficients are not truncated. See JPEG 2000 Part I Final Committee 
Draft Version 1.0. ISO/IEC JTC 1/SC 29/WG 1 N1646R Available: 
http://wwwjpeg.org/piiblic/fcdl5444-Lpdf, the entire disclosure of which is hereby 
incorporated herein by reference). Hence, overflow and/or underflow preferably do not take 

15 place. 

However, sometimes the stego-image may be converted into formats other than the 
JPEG2000 format. File format change should be considered permissible, and the original 
cover media should be preserved after the file format change. Nevertheless, we discovered that, 
sometimes, the recovered media may differ from the original media after the format changes 

20 from the JPEG2000 format to another format, such as, for instance, the bitmap format. This is 
because the pixel grayscale values in spatial domain are preferably represented by a fixed- 
length bit sequence (usually 8 bits, sometimes 16 bits). In a color image, each fundamental 
color component in a pixel is also often represented by 8 bits. After the IWT coefficients have 
been altered by the embedding operation, some pixels' grayscale values may be shifted out of 

25 the permitted grayscale value range as a result of the embedding operation. A typical grayscale 
range is 0-255 for an 8-bit grayscale image. 

This phenomenon is referred to as overflow/underflow, which is explained in the 
following. In our algorithm, some IWT coefficients are modified in the data hiding operation. 
And, this change in coefficient values, upon transforming the blocks into the spatial domain, 

30 causes a related change in the pixel grayscale values in the spatial domain. Moreover, this 
change in pixel grayscale values may exceed the typical grayscale value limit range, such as 0 
to 255, thereby causing an underflow condition and/or an overflow condition. Consequently, 
truncation generally occurs. Pixel grayscale values that are greater than 255 (overflow) are 
preferably set equal to 255, and those values that are smaller than zero (underflow) are 

35 preferably set equal to 0. From the information theory perspective, there is information loss in 
truncation, and thus the original image cannot be completely accurately recovered later, in the 
data extraction process. 
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5 Prevention of overflow/underflow. 

Generally, as long as the pixel grayscale values of the marked image in the spatial 
domain remain in the permitted range, there is no overflow/underflow problem discussed 
above. In the following, it is assumed that the pixel grayscale values will be represented by 8 
bits and that the permitted grayscale value range is [0,255]. If the permitted range is other than 

10 [0,255] for some images, the proposed principle remains valid, and the corresponding solution 
can be easily derived. 

A preferred embodiment of the method is block-based. Specifically, the method 
preferably splits the selected high-frequency sub-band into non-overlapping blocks, and the 
data hiding is , preferably carried out block by block. But the modification of the IWT 

15 coefficients in the blocks adjacent to a block under consideration can interfere with the 
grayscale values of the pixels corresponding to this block, which will become clear later in this 
document. Therefore, it is much easier for us to avoid the overflow/underflow if we can 
remove any interference from neighboring IWT-coefficient blocks. Therefore, the mask of FIG. 
4 is preferably employed. 

20 The outer elements on the bound of the mask are marked with '0's, and the inner 

elements by 6 Vs. In the mask, a c 0' identifies an IWT coefficient which will not be changed, 
while a '1' identifies a coefficient which will be changed during the data embedding process. 
In other words, at the bit embedding stage, in order to change the mean value of an IWT- 
coefficient block, we preferably only change those coefficients whose positions are associated 

25 with a 6 V bit in the above mask and preferably keep the coefficients intact in the outer bound 
of the block. Using the IWT theoiy of Christopoulos, it is readily shown that when the IWT 
coefficients in the selected blocks are changed using the mask as illustrated in FIG. 4, the 
grayscale values of pixels in adjacent blocks will not be affected. That is, the interference 
among neighboring blocks is preferably eliminated employing this approach. 

30 Once the interference among adjacent blocks has been removed, the discussion of 

preventing underflow/overflow can be focused on a single block. The basic idea is described 
in the following. Consider an IWT-coefficient block, B w , and its corresponding block in the 
spatial domain, B s . Here, the term "corresponding" refers to the group of pixels whose 
grayscale values are associated with the IWT-coefficient block. This correspondence relation 

35 can be readily determined according to the existing IWT theory. We assume that the absolute 
value of the maximum pixel grayscale value change is S max . In B s , overflow may occur if a) 
there are pixels having grayscale values greater than (255 - S max ) and b) the grayscale values 
need to be increased as a consequence of the bit embedding process. Underflow may take place 
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5 when there are pixels having grayscale values less than S max and the spatial-domain pixel 
grayscale values need to be decreased as a result of the bit-embedding process. 

To avoid overflow and underflow, the above two scenarios should not occur. We 
call [0, S max ] the 0-zone and (255-S ma x, 255) the 255-zone. In the spatial domain, we can 
classify a block into one of the four categories discussed below based on the placement of the 

10 pixel values of a block in relation to these two zones. In an ideal case, i.e. a type A block, no 
pixels are in the above-described zones, as illustrated in FIG. 5A. In this ideal case, we do not 
need to worry about the overflow/underflow problem. 

If in the B s there exist pixels in 0-zone but no pixel in 255-zone, i.e. a type B block, 
as is illustrated by the solid line in FIG. 5B, the underflow problem can be avoided if the pixel 

15 grayscale values are only changed in the positive direction. The resulting histogram after data 
hiding is shown by the dotted line in FIG. 5B. In FIG. 5B, no pixel has a grayscale value 
greater than 255 or smaller than zero, indicating that no overflow/underflow will occur. 

Preferably, with a type B block, if the mean coefficient value for the block is less 
than (i.e. more negative) than -T, the shift value is not applied to the block regardless of the 
20 value of the bit slated to be embedded in the block. 

FIG. 5C depicts a case opposite the one in FIG. 5B. If there is no pixel of this block 
in the 0-zone, i.e. a type C block, it will be safe if all changes to the pixels in this group are 
decreases. The worst case is described by FIG. 5D, where pixels in the 0-zone, and pixels in 
the 255-zone are both present in a single block, i.e. a type D block. In this case, the mean 
25 coefficient value is preferably not shifted to avoid underflow or overflow. Thus, errors in data 
extraction may take place and some additional measure has to be taken. Error code correction 
is one way to address any errors embedded in blocks for the reasons described above. 
Measures for addressing errors in data extraction will be discussed later in this paper. 

Prevention of overflow/underflow. 

30 In the above, we have proposed a preferred system and method for embedding bits 

into the IWT coefficients and have discussed how to avoid overflow/underflow by considering 
the effects on the grayscale values of embedding bits as described above. In a preferred 
embodiment, if we can find a way to embed one bit into one block of the HLi or the LHi sub- 
bands, and the grayscale values of affected spatial-domain pixels all move only in a direction 

35 in which no truncation is necessary to occur, the overflow/underflow problem can be avoided. 
In the following, this issue is explored further. 
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In the JPEG2000 standard, the 5/3 wavelet filter bank is generally used as the 



default filter for reversible image encoding. The 5/3 filter bank was first proposed by Le Gall 
et. al in L. Gall and A. Tabatabai, "Sub-band coding of digital images using symmetric short 
kernel filters and arithmetic coding techniques", Proc. of IEEE International Conference on 
Acoustics, Speech and Signal Processing, NY, pp.76 1-765, 1988 (referred to herein as "Gall"), 
the entire disclosure of which document is hereby incorporated herein by reference. The 
coefficients of the 5/3 wavelet filter bank are given in FIG. 6. 



transform (IWT) of an image. In this case, the effects of changing the wavelet coefficients on 
the spatial domain counterpart need to be investigated. 

From FIG. 6, the synthesis filter can be represented using the Z transform as 

follows. 

Low-pass filter: 



Our goal is to embed data into a high frequency sub-band of the integer wavelet 
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High-pass filter: 
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For a 2-D image, we have, 



Low-pass filter in x axis: 



(5) 



High-pass filter in x axis: 




(6) 



Low-pass filter in y axis: 




(7) 



High-pass filter in y axis: 
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10 



For the HL sub-band, the synthesis filter is to first apply the high-pass to the row (x 
direction), then apply low-pass to the column (y direction). The result is: 

G HL (z x9 z y ) = G Hx (z x )xG Ly (z y ) (9) 

= -* _ 2 , 6 _ 2 _ 1 2)X( I -i +1+ 1 } 
v 8 * 8 8 8 * 8 * 2 * 2' 

1 -2 -1 1 -2 1 -2 1 -1 -1 2 ! 1 I 

= -i6 z - ~s z * -ir- z ^8^ z ' ~r> -r- z > 

3 6 3 1 2 1 1 a -i 1 2 1 2„ 

+ i Z ^' + 8 + 8^"8 Z ^ -r'-I^'"^**' -8 Z *-TT* Z ' 



For the LH sub-band, the synthesis filter is to first applies low-pass filtering to the 
rows (x direction), then applies high-pass filtering to the columns (y direction). The result is: 

G L Az x ,z y ) = G^)xG Hy (z y ) (10) 

15 =(--z ~ 2 --z ' l +---z--z v 2 )x(-z x - l +l + -z x ) 

1 8 y 8 ' 8 8 " 8 y 2 x 2 x 

1 2 1 1 -2 1 -2 1 -1 -1 2 -1 1 -1 

*• s z > Ye Zy z *~8 Zy z * ~s Zy ~r> z * 

3 _, 6 3 1 2 1 1 2-1 1 2 1 2 7 

V* + i + 8 z *-i z ' z * vs'>'' "I''"**'*- 



For the LL sub-band, apply low-pass filter to both directions. The result is: 
G LL (z x ,z y ) = G u (z x )xG Ly (z y ) (11) 

20 = (| V' + 1 + x (| V 1 + 1 + \ z y> 



From Formula (9), the unit step response in the spatial domain for a change 
occurring to an IWT coefficient located at the (i, j) position in the HLi sub-band can be 
25 expressed with the matrix Ghl in FIG. 7. Specifically, an array of pixel grayscale values in the 
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spatial domain centered at (2i-l, 2j) with a size of 3x5 (refer to FIG. 10) will change by the 
amount specified by the elements of this matrix (FIG. 7) if the IWT coefficient (i, j) in the HLi 
sub-band is increased by 1 . 

The corresponding unit step responses can be found for the coefficients in LHi and 
LLi sub-bands, denoted by G L h and G L l, respectively, as shown in FIG. 8 and 9. They indicate 
the magnitude of changes of the grayscale value in the spatial domain if the IWT coefficient (i, 
j) in the LHi or LLi sub-band is increased by 1. The center is located at (2i, 2j-l) and (2i-l, 2j- 
1) for Glh and G L l, respectively. 

Assuming the HLi is the earner sub-band, FIG. 7 indicates that the pixels affected 
by wavelet coefficients change will experience changes in their grayscale values in both 
increasing and decreasing directions. Similarly, this is true if the LHi is used as the carrier 
sub-band. According to the matrix in FIG. 9, the pixel grayscale values will move in only one 
direction. One preferred approach to combating overflow/underflow here is to manipulate 
coefficients in the LL sub-band to cause the resulting spatial-domain pixel grayscale value 
change to occur in only one direction. 

First, let us consider a unit input into an HLI coefficient with position (i j), denoted 
by C HL (iJ) . We denote the output response in spatial domain by k(z, j) , where k represents 
the amplitude, and (i 9 j) indicates the coordinates of the point in the spatial domain that is 
affected. The unit input described above and its response in the spatial domain are represented 
in the next equation. 



C ffL (j,j) O 

-^(2i-2,2y-2)™(2^ 
16 o lo o 

-|(2/ - 1,2 j ~ 1) 2] - 1) + - 2,2 j) + |(2i - 1,2 j) 

8 8 8b ^ 

+1(2^2/) -1(2/ - 2,2 j + 1) - f (2/ - 1,2 j + 1) -4(21,27 + 1) 
8 o o o 

-4(2/ -2,2 j + 2) - 1(2/ -1,2./ + 2) " ^-(2i,2J + 2) 
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5 In the above equation, -— (2/-2,2y-2) means that -— is the response caused 

16 1° 

by the unit input to the HL1 coefficient at (i, j), (2/ - 2,2/ -2) means the response is located at 

(2i-2, 2j-2) in the spatial domain. 

If a unit input is applied to an LL coefficient positioned at (i, j), the corresponding 

response is: 

10 

1(2/ - 2,27 - 2) + -(2/ - 2 5 2j - 1) + j(2/ - 2,2 j) 
4 2 4 

+ - (2/ - 1, 2j - 2) + 1(2/ - 1, 2] - 1) + \ (2i = 1, 2/) (13) 
2 

+-(2z,2y - 2) + i(2i,27 - 1) + i(2i,2y) 
4 2 4 

We can verify that: 

15 C^)+^C^+ic LL (i,j+l) o \{2i - 2,27) + 1(2/ - W) + |(2^2j) (14) 

It can be seen that if we change an HL1 coefficient at (ij) by S and change both 
LL1 coefficients at (ij) and (ij+1) by S/4, the pixel value change in the spatial domain will 
have the same sign as S. 
20 Similarly, for a coefficient at LH sub-band, we can get 

C^+^C^U)+ic LL (i+1, j) o i(2/,27^2) + l(2/,2y-l) + |(2/,27) (15) 

The above formulae form the basis of our method which is described as follows. 
Now we study the unit response of LL coefficients. It can be deduced that for an LLi 
25 coefficient at (i, j), the center of the unit step response is at (2i-l, 2j-l) in the spatial domain, 
while the center of unit step response will be at (2i-l, 2j+l) for LLi coefficient (i, j+1). FIG. 
10 illustrates the relationship of IWT coefficients and a resulting unit step response in the 
spatial domain. The solid arrow and the solid box are for the coefficient (i, j) in the HLi sub- 
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5 band and the affected pixels in the spatial domain. The dotted arrow and dotted box are for 
LLi coefficient (i, j) and its affected pixels in the spatial domain. And, the dashed arrow and 
dashed box are for the LLi coefficient (i, j+1) and its affected pixels in the spatial domain. 

It is observed that the combined 2-D arrays corresponding to the unit step responses 
of LLi coefficients (i, j) and (i, j+1) have a size of 3x5 and cover the exactly same group of 
10 pixels corresponding to the unit step response of the IWT HLi coefficient (i 5 j). We use Cnhih 
j) to denote the coefficient at (i, j) of the HLi sub-band and C L L(i, j) to denote the coefficient at 
(i 5 j) of the LLi sub-band. Then, it is straightforward to verify that the combined effect of the 
unit step response of C H L(i, j), the % of the unit step response of C LL (i, j), and the Va of the unit 
step response of CuiU j+1) can be expressed by the matrix in FIG. 11. 

15 The matrix of FIG, 11 indicates that all the changes in the spatial domain will 

preferably have the same sign. In other words, if we decide to embed data in the HLi sub- 
band, and if we change an HLi coefficient at (ij) by S and change both LLi coefficients at (ij) 
and (i by S/4, the pixel grayscale value change in the spatial domain will have the same 
sign as S. It is noted that after the embedding, the maximum pixel grayscale value change in 

20 the spatial domain is also S. Similarly, if the LHi serves as the carrier sub-band, the same goal 
can be achieved by changing the LHi coefficient at (i j) by S and both the LLi coefficients at 
(ij) and (i+1 j) by S/4. In the next section, in which we present the proposed data embedding 
procedure, it becomes clear how overflow and underflow can be avoided. 

Data Embedding 

25 In a preferred embodiment, data can be embedded into sub-band HLI or sub-band 

LHI. We will explain our method assuming HLI is used. At first, HLI sub-band is divided 
into non-overlapping blocks having a size N coefficients by N coefficients. In each block, the 
mean value is either 0 or close to zero. Equivalently, the absolute value of the mean should be 
less than a threshold C T\ The basic idea is if a bit 6 T is going to be embedded, the absolute 

30 mean value of the block will preferably be shifted away from 0 by a quantity larger than C T\ 
Here, we do not change every coefficient of this block. Instead, we use the mask of FIG. 4. In 
this example, we only change those coefficients whose positions are marked with ' V in the 
above mask. In other words, we keep the outer bound of the block unchanged. By doing this, 
the effects of changing coefficients in one block will not interfere with the effects of changing 

35 coefficients in adjacent blocks. 

If the coefficient at (i, j) in the HLi sub-band is changed by a quantity S (S can be 
either positive or negative), we will modify two coefficients of LLi sub-bands at (i, j) and (i, 

16 



WO 2005/057905 



PCT/US2004/040442 



5 j+1) each by a quantity S/4. As a result, all the affected pixels in the corresponding 3x5 array 
will either increase (if S>0) or decrease (if S<0). Alternatively, if the LHj sub-band is used as 
the embedding carrier and we modify the coefficient at (i, j) in LHi sub-band by S, we need to 
change the two corresponding coefficients of LLi sub-bands at (i, j) and (i+1, j) by S/4. 
Similarly, all the affected pixels in the spatial domain will either increase (if S>0) or decrease 
10 (is SO). As mentioned earlier in this document, in our algorithm, mean value change of a 
block is preferably achieved by changing all the coefficients with a mask value of 6 1' in FIG. 4 
by the same amount. For each coefficient in the HLi or LHi sub-band whose value needs to be 
changed by S, we not only change its value by S, but also change two corresponding 
coefficients in the LLi sub-band by S/4 as discussed above. 

15 Before the embedding process, error correction coding and chaotic mixing are 

preferably applied to the information bits to be embedded. The purpose of applying these two 
techniques in the preferred method is to improve the robustness of the method. The coded bit 
stream and the cover image are the input to the embedding system and method. 

When embedding a bit into a block of IWT coefficients in HLi or LHi sub-band, 
20 B W5 we first find out the corresponding spatial block that could possibly be affected by the 
embedding, B s . Then the pixel grayscale values of B s are preferably checked, and the block is 
preferably classified into four categories illustrated in FIG. 5. 

A: The block has no pixel grayscale values greater than (255-Smax), namely 255-zone, or 
smaller than S max , i.e., 0-zone. 
25 B: The block has pixel grayscale values smaller than S ma x but no pixel values greater than 

(255-S m ax)- 

C: The block has pixel grayscale values greater than (255-S max ) but no pixel values smaller 
than S max 

D: The block has both pixel grayscale values smaller than S max and greater than (255-Smax) 

30 Different operations are applied to the different categories, which is one of the key 

points of this embodiment of the invention. The operations are presented below. 

For blocks of type A: 

In this case, overflow and underflow will not take place. We can either increase the 
mean value or decrease the mean value of the carrier sub-band (HLi or LHi) coefficients in the 
35 block as discussed below. The mean value may be greater than zero or less than zero as is 
depicted in FIG. 5A. The detail is illustrated in FIG. 12. If a '1' bit is to be embedded, the 
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5 mean value will be shifted away from 0 by the shift quantity S. Where the mean value is 
initially at a, it is preferably shifted to a'. Where the mean value is initially at b, it is preferably 
shifted to b\ In a preferred embodiment, we modify the corresponding coefficients in the LL 
sub-band by S/4 as described earlier in this document. For bit '0 5 , the mean of the coefficients 
of this block will preferably remain unchanged. 

1 0 For blocks of type B. 

In a preferred embodiment, the blocks of type B have some pixel grayscale values 
only in the 0-zone as illustrated in FIG. 5B. Overflow will not occur for this type of blocks. To 
embed a bit '0', the block is preferably left unchanged. To embed a bit 6 V, the absolute value 
of the mean is preferably shifted by S. If the original mean value is greater than or equal to 

15 zero, as is the case of -a- in FIG. 13, the mean can be shifted to -a'-. Besides, we modify the 
corresponding coefficients in the LL sub-band by S/4. As a result, the pixel values in the 
spatial domain will only increase after changes are made. Therefore, underflow is avoided. If 
the mean value is negative, as with -b-, to make the absolute value of the mean value greater 
than T, the mean value can only be subtracted from by the amount of the shift quantity S. 

20 However, underflow may happen as this subtraction is performed. Hence, the mean value -b- 
preferably is not shifted. But, leaving the block unchanged corresponds to embedding a bit '0'. 
Therefore, error correction coding is preferably employed to correct any resulting errors. 

For blocks of Type C. 

In a preferred embodiment, the blocks of this type have some pixel grayscale values 
25 only in the 255-zone as illustrated in FIG. 5C. This case is similar to the previous one except 
everything is opposite to the type B blocks. To embed bit c 0', no operation is needed. To 
embed a bit c l 5 , only the blocks with a negative mean coefficient value can be used to hide bit 
c l'. If bit ' 1' is to be embedded and the mean value of the block is positive, we will not be 
able to embed the bit. Instead, in this situation, the block is preferably left unchanged, which is 
30 equivalent to embedding a "0" bit. Thus, an erroneous bit may be embedded. Preferably, error 
correction coding (ECC) and/or error correction decoding (ECD) are employed to correct any 
such bit errors. FIG. 14 illustrates the detail with respect to the mean values for a block of type 
C. 

For blocks of Type D. 

35 Generally, this type of block simultaneously has some pixel grayscale values in the 

0-zone and some grayscale values in the 255-zone, as is illustrated in FIG. 5D. Shifting the 
mean value in either direction will generally cause either overflow or underflow. If an 
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5 information bit c 0' is to be embedded here, there is no problem. However, if we need to embed 
a "1" bit, an error will occur. Again, we rely on ECC and/or ECD to rectify any resulting error 
bits. 

It is noted that for frequently used images, most blocks belong to type A. For 
example, all blocks are type A blocks for the popular "Lena" image, marked versions of which 
10 are shown in FIGS. 21-22. For this types of blocks, ECC may not need to be used to correct 
the errors. 

In some cases, the errors may cluster together. This is called "burst error" in the 
literature. That is, blocks of type B, C, D may be concentrated in some region(s) of a given 
image. In this situation, ECC may not be able to efficiently correct the errors. Chaotic mixing 
15 (or other type of permutation) is preferably introduced to combat burst errors. It can disperse 
the burst errors evenly. FIG. 15 illustrates the block diagram of a method for data embedding 
in accordance with a preferred embodiment of the present invention. 

In a preferred embodiment, the block size B, the threshold T, the shift quantity S, 
and an identification of the sub-band chosen as the carrier are saved as side information which 

20 can be later used for data extraction and recovery of cover media. From a different 
perspective, the side information can be treated as a key. In other words, it is preferably 
transmitted through a separate, secure channel. Preferably, only authorized persons can extract 
the hidden data with the provided side information. An alternative method is to use the same 
B, T, S, and sub-band for all images. As a result, the side information will preferably not be 

25 needed. The drawback of this alternative method is that the parameters may not be the optimal 
for an image processed in this manner. 

Data Extraction And Original Image Recovering. 

At the receiver side, a stego image is the input. Data extraction and original cover 
image restoration will be performed. 

30 Hidden data extraction 

In a preferred embodiment, the hidden data extraction needs the stego-image and/or 
the side information. Preferably, the high frequency integer wavelet coefficients of the stego- 
image are first obtained. Then the carrier sub-band is preferably divided into non-overlapping 
blocks of size B. Preferably, one bit of data is extracted from each block. 

35 The comparison of the mean value with the threshold T can decide whether a bit '0 ' 

or a bit 'V is extracted. It is noted that if the test image is in the JPEG2000 format, the IWT 
coefficient can be obtained directly from the JPEG2000 bitstream without performing wavelet 
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5 transform. Preferably, the data extraction will be efficient since no extra transformations are 
needed. 

Hidden data extraction is preferably easier than embedding. In the IWT domain, if 
the data is embedded in the HL1 sub-band, we divide the sub-band into blocks in the same way 
as in the embedding stage. Then the mean value of the coefficients of each block is preferably 
10 calculated. 

Preferably, for each block, the method then extracts one bit for the block and 
restores the mean coefficient value for the block to its original value. Specifically, , the method 
preferably extracts a logical- 1 bit if the absolute value of the post embedding, pre-extraction 
mean coefficient value is equal to or greater than the threshold T. Preferably, the method 

15 extracts a logical-0 bit value if the post-embedding, pre-extraction mean coefficient value for 
the block is less than the threshold T. Therefore, at the data extraction stage, the method 
preferably restores the original value of the mean coefficient value for the block by reversing 
the shifting operation that was conducted during the embedding stage. Specifically, after the 
data bit is extracted, the method preferably reduces the absolute value of the block's mean 

20 coefficient value by the magnitude of the shift quantity, S, used during the embedding process, 
while maintaining the sign of the mean coefficient value. 

Preferably, for a T-bit block, the coefficients in the HL1 or LL1 sub-bands are 
shifted toward 0, back to their pre-embedding values. Consequently, the original IWT 
coefficients are recovered. It is noted that for hidden data extraction and original image 
25 recovering, there is preferably no need to go back to the spatial domain. This feature is very 
convenient for JPEG2000 images. 

In the foregoing, we have assumed that the initial mean value of each block is zero 
or close to zero, or at least is less than the threshold T. The latter is ensured since T is greater 
than the m max as defined in Fomula (2). 

30 Cover Media Restoration 

After all hidden data have been extracted, we can recover the original cover media. 
Two possibilities exist here. The first possibility is that the stego-image has not been subjected 
to any alteration. The other possibility is that the stego-image has undergone some alteration. 
With the first possibility, the original image can preferably be recovered. FIG. 16 is a 
35 flowchart illustrating a method for recovering the original image in accordance with one or 
more preferred embodiments of the prevent invention. It is noted that if the recovered image 
has been altered, the original cover image may not be fully recoverable. 
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5 Experimental Results And Performance Analysis. 

Generally, the bit stream to be embedded is randomly generated using the randint() 
function of Matlab™. (Function randint generates a random scalar that is either zero or one, 
with equal probability.) We applied our algorithm to various images including those in the 
USC SIPI image database, the CorelDraw™ image database, and JPEG2000 test images. Our 

10 proposed algorithm works well for all of the test images. FIG. 17 is a marked medical image, 
and FIG. 18 is a marked JPEG2000 test image (N1A) generated using our proposed algorithm. 
Thus, the same amount of data is embedded into images of the same size. It is noted that the 
problem of salt-and-pepper noise is not incurred with the system and method disclosed herein 
and that the visual quality of stego-images produced by our proposed algorithm is much higher 

15 than that of the prior art. Specifically, the PSNR of the marked medical image (FIG. 17) with 
respect to its unmarked original version is 38.53dB instead of 9.28dB as with the prior art. 
And, for the Nl A JPEG2000 test image, the PSNR is 42.67dB (using the preferred system and 
method disclosed herein) instead of 18.01 dB with the prior art. 

To analysis the performance of the proposed algorithm, we presents the testing 

20 results in detail with the Lena image, the most commonly used image in the image processing 
community, which is an 8-bit grayscale image with a size of 512x512. 

Data Embedding Capacity 

In our algorithm, the data embedding capacity is preferably determined by the block 
size B. The smaller the B is, the more blocks there will be. Consequently, more bits can be 

25 embedded. The table in FIG. 19 gives the relationship between the block size and capacity. 
Error correction coding is preferably used to enhance the robustness of our algorithm. The 
ECC coding scheme used here employs BCH codes [13]. However, other error correction 
coding/decoding methods may be employed. In general, the stronger the error correction 
capability is, the smaller the embedding capacity will be. BCH (15, 1 1) codes were used in this 

30 experimental investigation. The capacity can thus calculated as 

MxN 11 

C . = x— (8) (16) 

capacuy 4xBxB 15 

where M, N are the size of the image, and where B is the block size. In FIG. 19, 
"Capacity with ECC" denotes the capacity determined with the formula above and is 
sometimes referred to as "pure payload," while "Capacity without ECC" indicates the capacity 
3 5 achieved if ECC is not used. 

Visual quality of marked images. 
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5 The visual quality of the stego-images with respect to the original image may be 

measured using the parameter of peak signal to noise ratio (PSNR). 

From FIG. 19, it is seen that use of a smaller block size can provide larger data 

storage capacity. However, the m max , the maximum absolute mean value of IWT coefficient in 

a block which was earlier defined in Formula (2), will be larger. Hence, a smaller block size 
10 generally requires that the shift quantity S be larger, thereby degrading the quality of the 

marked image, causing lower PSNR. The data in FIG. 20 tends to support this observation. 

Indeed, the "Minimum shift value" is defined as the smallest integer that is greater than m max . 

FIGS. 21 and 22 display the marked "Lena" images using the minimum shift values 
for different block sizes. We can see that the visual quality of the marked image of FIG. 21 is 
15 worse than that of FIG. 22. The PSNR of the image of FIG. 22(b) is 9 dB greater than that of 
the image of FIG. 22. 

Using the same block size, different shift values can be applied. Later, we will see 
that the larger the shift value is, the more robust the stego-image becomes. FIG. 23 reveals the 
PSNRs of stego-images with different block sizes and different shift values. We can see that 
20 the price for a larger shift value is a lower PSNR value, i.e. lower image quality. 

Robustness of Hidden Data. 

Different from most lossless watermarking algorithms, the proposed IWT based 
lossless watermarking algorithm is robust to incidental alterations. 

At first, it is robust to JPEG2000 compressions. In experimental study, the marked 

25 images are JPEG2000-lossy-compressed with increasing compression ratios. The robustness 
against JPEG2000 compression is measured by data survival rate, meaning that when the 
resultant data rate after compression is above or equal to the data survival data rate, the hidden 
data can be reliably extracted without error. In other words, the lower the data survival rate is, 
the more robust the stego-image is. 

30 FIG. 24 shows the relationship between the PSNR of a stego-image and the data 

survival rate under different block sizes. It can be observed that in general for a given block 
size B the lower the PSNR, the lower the data survival rate. For a given data survival rate, if a 
larger block size is chosen, the higher PSNR can be achieved which will lead to better image 
quality but lower data embedding capacity. It is obvious that the image visual quality, the 

35 robustness of hidden data, and the embedding capacity are all interrelated and some are 
conflict with one another. We cannot achieve the best performance for all of these aspects 
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5 simultaneously. Generally, for each specific application, an optimum combination of these 

characteristics can be achieved. 

In addition to compression, the proposed watermarking scheme is robust in the face 

of the use of a median filter as well. A median filter with the size of 3 by 3 can be applied to 

the stego-images. Then, hidden data extraction is preferably performed on the filtered stego- 
10 images. FIG. 25 lists the test results in which the number of decoded error bits is measured 

during the data extraction procedure. It is obvious that a larger block size and a larger shift 

value will make the marked image more robust against a median filter. 

Additive Gaussian noise is type of noise frequently encountered in communication 
systems. In our experiments, a Gaussian noise with a zero mean and variance of 0.001 is 

15 added to the marked images. The error bits in hidden data extraction are counted, and FIG. 26 
lists the results. As before, a conclusion can be drawn that, in general, the larger the block size 
the shift value are, the more robust the stego image will be. 

Herein, a new semi-fragile lossless data hiding scheme is disclosed. Data are 
embedded into the integer wavelet domain. The preferred method disclosed herein can 

20 preferably be seamlessly integrated into the JPEG2000 standard. The issues of overflow and 
underflow are addressed herein. The original cover media can be recovered after hidden data 
extraction if the marked image has not been altered. The hidden data are robust to non- 
malicious attacks, e.g. lossy compression, to a certain degree. Performance analysis is 
conducted which shows the advantages of the proposed algorithm. The visual quality of the 

25 marked image is excellent. Salt-and-pepper noise and other artifacts are successfully avoided 
in the stego-images. It can be applied to content-based image authentication and some other 
applications. 

The entire disclosures of the following documents are all hereby incorporated 
herein by reference. 
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Although the invention herein has been described with reference to particular 
embodiments, it is to be understood that these embodiments are merely illustrative of the 
principles and applications of the present invention. It is therefore to be understood that 
35 numerous modifications may be made to the illustrative embodiments and that other 
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5 arrangements may be devised without departing from the spirit and scope of the present 
invention as defined by the appended claims. 
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